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Abstract 

The available taxonomic expertise and knowledge of species is still inadequate 
to cope with the urgent need for cost-effective methods to quantifying commu- 
nity response to natural and anthropogenic drivers of change. So far, the main- 
stream approach to overcome these impediments has focused on using higher 
taxa as surrogates for species. However, the use of such taxonomic surrogates 
often limits inferences about the causality of community patterns, which in 
turn is essential for effective environmental management strategies. Here, we 
propose an alternative approach to species surrogacy, the "Best Practicable 
Aggregation of Species" (BestAgg), in which surrogates exulate from fixed taxo- 
nomic schemes. The approach uses null models from random aggregations of 
species to minimizing the number of surrogates without causing significant 
losses of information on community patterns. Surrogate types are then selected 
in order to maximize ecological information. We applied the approach to real 
case studies on natural and human-driven gradients from marine benthic com- 
munities. Outcomes from BestAgg were also compared with those obtained 
using classic taxonomic surrogates. Results showed that BestAgg surrogates are 
effective in detecting community changes. In contrast to classic taxonomic sur- 
rogates, BestAgg surrogates allow retaining significantly higher information on 
species-level community patterns than what is expected to occur by chance and 
a potential time saving during sample processing up to 25% higher. Our find- 
ings showed that BestAgg surrogates from a pilot study could be used success- 
fully in similar environmental investigations in the same area, or for subsequent 
long-term monitoring programs. BestAgg is virtually applicable to any environ- 
mental context, allowing exploiting multiple surrogacy schemes beyond stag- 
nant perspectives strictly relying on taxonomic relatedness among species. This 
prerogative is crucial to extend the concept of species surrogacy to ecological 
traits of species, thus leading to ecologically meaningful surrogates that, while 
cost effective in reflecting community patterns, may also contribute to unveil 
underlying processes. A specific R code for BestAgg is provided. 



Introduction 

The unprecedented increase in anthropogenic disturbance 
worldwide has exacerbated concerns about the potential 
ensuing depletion of biodiversity and ecosystem 
functioning (Hooper et al. 2012). However, the intrinsic 



complexity of ecological systems largely limits our ability 
to predict their possible critical transitions toward unde- 
sirable states (Scheffer et al. 2012). Environmental impact 
assessment and monitoring, therefore, are of basic impor- 
tance in revealing the effects of human pressures and their 
interactions with natural sources of variability, detecting 
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early signals of phase shifts, and guiding subsequent 
adaptive management and mitigation strategies (Hill and 
Arnold 2012). 

Wide gaps in knowledge of phylogenetic, taxonomic, 
and functional characteristics of most species (Lomolino 
2004; Whittaker et al. 2005; Cardoso et al. 2011) make 
difficult quantifying human-driven patterns of changes 
and unveiling underlying ecological processes. Progresses 
in molecular analyses, such as DNA bar coding of organ- 
isms, are helping the process of cataloging biodiversity 
(Gross 2012), and recent developments in this field high- 
lighted the value of genetic tagging in estimating ecological 
properties of communities despite the inherent loss of taxo- 
nomic information (e.g., Fonseca et al. 2010; Yu et al. 
2012). Molecular analyses and bioinformatics, nevertheless, 
represent complementary but not alternative approaches to 
huge endeavors for research in taxonomy and autoecology 
(Wilson 2004), which are inevitable for advancing the 
knowledge of biodiversity (May 1990; Wheeler 2004; 
Wheeler et al. 2004; de Carvalho et al. 2007). 

Despite renewed efforts in the exploration of biodiver- 
sity (e.g., Snelgrove 2010; Fontaine et al. 2012) and in the 
enhancement of taxonomy and systematics (Boero 2001; 
Wilson 2003), current knowledge of species is still far 
from being exhaustive (Pereira et al. 2012) and the avail- 
ability of taxonomic expertise appears still insufficient 
(Wagele et al. 2011) to cope with the current need of 
timely solutions to pressing environmental problems. 

This so-called 'taxonomic impediment' (e.g., Wheeler 
2004) is challenging in applied ecological research to pro- 
vide cost-effective methods for elucidating the response of 
communities and ecosystems to natural and anthropo- 
genic drivers of change (Pik et al. 1999; Jones 2008; 
Mandelik et al. 2010; Mellin et al. 2011). A mainstream 
practice to overcome this hindrance across terrestrial, 
freshwater, and marine environments focuses on the use 
of higher taxa as surrogates for species (Bevilacqua et al. 
2012). The higher taxon approach in environmental inves- 
tigations is based on the concept of taxonomic sufficiency, 
which involves the use of coarse taxonomic resolution 
without causing a significant loss of information, thus 
avoiding costly, time-expensive, and difficult species- 
level identifications (Beattie and Oliver 1994). Such an 
approach, especially when based on intermediate taxo- 
nomic ranks (i.e., Genus and Family), is generally effective 
in depicting species-level patterns of community response 
under a wide range of environmental settings (e.g., Heino 
and Soininen 2007; Lovell et al. 2007; Terlizzi et al. 2009). 

However, taxonomic sufficiency implies the static 
grouping of organisms in taxa belonging to a single 
taxonomic level higher than species (e.g., all organisms 
identified as genera, or families, etc.) irrespective of 
their ecological relevance or difficulty of taxonomic 



identifications. As a consequence, the use of higher taxa 
as surrogates for species (hereafter referred to as taxo- 
nomic surrogates) often restricts inferences about the cau- 
sality of the observed patterns (Lenat and Resh 2001; 
Terlizzi et al. 2003; Jones 2008). 

Uncertainties about the appropriateness of this 
approach to species surrogacy may depend on the fact 
that related empirical studies have amassed in the absence 
of incisive efforts in structuring a solid theoretical frame- 
work for the application of taxonomic surrogates. Puta- 
tive similarities in ecological traits among closely related 
species, or hierarchical (from species to higher taxonomic 
ranks) responses to environmental disturbance, have been 
invoked to substantiate the ability of taxonomic surro- 
gates to mirror species-level patterns (e.g., Warwick 1993; 
Ferraro and Cole 1990; Heino and Soininen 2007). Such 
explanations are, nevertheless, unable to elucidate exhaus- 
tively the reasons behind the success, or failure, of taxo- 
nomic surrogates (Lenat and Resh 2001; Bertrand et al. 
2006; Dethier and Schoch 2006; Bevilacqua et al. 2009), 
and are difficult to validate experimentally. The absence 
of clearly stated assumptions on the effectiveness of taxo- 
nomic surrogates, and the lack of standard methods for 
quantifying the probability of Type-I error when identify- 
ing a particular taxonomic level as effective in discerning 
a given pattern of interest, raised criticism about their 
potential utility (Mellin et al. 2011). 

Several studies that have investigated factors affecting 
the performance of taxonomic surrogates, such as taxo- 
nomic relatedness among species, outlined that higher taxa 
perform better as surrogates for species when they are 
poor in species (e.g., Lovell et al. 2007), or there is a small 
mean and variance in the number of species per higher 
taxon (e.g., Neeson et al. 2013) or, in other words, when 
the ratio of the number of species to the number of higher 
taxa is low (e.g., Giangrande et al. 2005; Dethier and 
Schoch 2006). In a recent attempt to shed light on poten- 
tial mechanisms determining the performance of taxo- 
nomic surrogates, Bevilacqua et al. (2012), working on 
marine molluscs at a regional scale, used null models to 
show that higher taxa of the Linnaean taxonomic hierar- 
chy may be considered as arbitrary categories of species 
unlikely to convey consistent responses to natural or 
human- driven environmental changes. A similar approach, 
based on the metacommunity concept, led Siqueira et al. 
(2012) to analogous conclusions when investigating con- 
gruences in spatial patterns of variation in community 
composition of freshwater invertebrates among the whole 
set and different subset taxa. Bevilacqua et al. (2012) 
showed that information loss and the ensuing decrease in 
statistical power to detect changes in assemblage structure 
at higher taxonomic levels depended on the degree of spe- 
cies aggregation (exemplified by the ratio between the 



2013 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. 



3781 



BestAgg: a Step Forward for Species Surrogacy 



S. Bevilacqua ef al. 



number of higher taxa and the number of species), rather 
than on taxonomic relatedness of species (i.e., the relative 
closeness of species in the Linnaean taxonomic hierarchy) 
(see also Siqueira et al. 2012 for similar findings). By ana- 
lyzing 20 years of research on taxonomic surrogates, the 
authors also found strong evidence supporting the general- 
ity of such findings across a wide range of terrestrial, fresh- 
water, and marine organisms. 

In this perspective, here, we propose a novel approach 
to species surrogacy, the Best Practicable Aggregation of 
Species (BestAgg), that allows alternative ways to aggregate 
species into surrogates, beyond static taxonomic grouping, 
in order to maximizing ecological information and to 
optimizing the use of surrogates for species in ecological 
studies. Taxonomic sufficiency concerns the use of higher 
taxa as surrogates for species and aims to identifying the 
coarser level of taxonomic resolution sufficient to allow 
the assessment of community response to environmental 
drivers. The BestAgg approach, instead, relies on determin- 
ing the sufficient (i.e., minimum) number of surrogate 
groups, irrespective of their type (i.e., if taxonomic, 
morphological, functional, etc.), that could be used while 
still obtaining consistent results with species-level commu- 
nity response. As for any rigorous surrogacy approach (e.g., 
Van Wynsberge et al. 2012), taxonomic sufficiency relies 
on a first assessment of the sufficient taxonomic resolution 
based on species-level data (e.g., Terlizzi et al. 2003; Defeo 
and Lercari 2004; Jones 2008). In this framework, a pilot 
investigation compares results of analyses at species level 
with those obtained using higher taxa. Species-level data 
are therefore aggregated (i.e., grouped and summed) into 
higher taxa and the coarser taxonomic resolution able to 
provide consistent results with those obtained from spe- 
cies-level data is assumed to be suitable for subsequent 
monitoring or for very similar study contexts. 

Following the same framework, we used species-level 
information from pilot studies to identify the sufficient 
number of surrogates able to depict community patterns 
consistently with species-level information. Surrogates 
were then defined based on their ecological importance 
(relevance), low difficulty of taxonomic identification dur- 
ing sample processing (easiness), and shared characteris- 
tics among organisms (resemblance). Finally, we tested the 
performance of BestAgg surrogates in similar study con- 
texts and compared their response with classic surrogates 
based on taxonomy (i.e., higher taxa). 

Methods 

The BestAgg: assumptions and rationale 

The identification of the sufficient number of surrogates 
is based on a null model assuming that surrogates may be 



considered as random groups of species from the original 
species pool found in the study (see Bevilacqua et al. 
2012 for a full theoretical discussion). Specifically, the 
model assumes that (1) the ability of surrogates to exhibit 
the same community response detectable at species level 
depends on the residual information retained in the 
aggregated data. This residual information can be 
expressed as the Spearman's correlation (p) between the 
species-level data matrix and the corresponding aggre- 
gated matrix (Somerfield and Clarke 1995). The bulk of 
evidence on the application of taxonomic surrogates 
widely supports this assumption (Bevilacqua et al. 2012). 

However, while for taxonomic sufficiency the coher- 
ence in community responses between species and higher 
taxonomic ranks would originate from the putative eco- 
logical similarities among species within higher taxa (e.g., 
Warwick 1993), the rationale underlying BestAgg is that 
it is the level of aggregation that matters or, in other 
words, the number of surrogates in which species are 
aggregated, irrespective of underlying aggregation criteria. 
In this view, it is assumed that (2) the residual informa- 
tion in the aggregated matrix (p) depends on the level of 
aggregation, which can be expressed as 0 = G/S, that is, 
the ratio of the number of surrogates groups G in which 
species are aggregated to the number of species S in the 
original species-level matrix. Such a dependence between 
p and f/> has been largely confirmed analyzing the scien- 
tific literature on taxonomic surrogates, which showed 
that the decrease in p at decreasing r/> follows a semilog 
model consistent across different organisms and habitat 
types (Bevilacqua et al. 2012). Under the two aforemen- 
tioned assumptions, the effectiveness of a given set of 
surrogates for species (i.e., their ability to emulate spe- 
cies-level community response) depends on the level of 
aggregation (j>- 

The BestAgg approach consists of two main steps. First, 
a null model based on random aggregations of species- 
level data is built to determine the value of </>, namely, 
4>i ow (i.e., the lowest practicable aggregation level), suffi- 
cient to obtain results consistent with those obtained 
analyzing species-level data. Second, the study-specific 
surrogate groups are identified on the basis of r/> low and a 
set of selection criteria aiming to capitalize on the ecolog- 
ical information in the aggregated matrix. 

Null model based on randomly aggregated 
matrices 

Let M s be a matrix S x N of any type of abundance data 
collected at species level from any assemblage, where S is 
the number of species and N the number of samples. The 
aggregated matrix derived from M s is a G x N matrix in 
which all the original S species have been assigned to G 
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groups (with G < S) and their abundance summed. The 
level of aggregation of this matrix is defined as 0 = G/S. 

The whole procedure for the construction of the null 
model is synthesized in Figure 1. The aim of the 
approach is to identify the lowest 4> value at which (1) 
there is consistently a strong correlation between the spe- 
cies and surrogate dissimilarity matrices, regardless of 
how species are aggregated into surrogate groups, and (2) 
the probability of statistical tests to fail in detecting signif- 
icant differences for the term of interest in the analysis 
when using the corresponding number of surrogates is 
P < 0.05. 

For the construction of the null model, the S species in 
the original matrix M s are randomly aggregated in a 
decreasing number of G groups in order to simulate 
decreasing values of </>. The number of groups G is pro- 
gressively decreased, starting from G = S, by stepwise 
reductions of a fixed decrement d. Decrements of 
d = 10~' x S (i.e., 10% of S) allow defining a representa- 
tive set of simulated decreasing (j) values for a wide range 
of S, and are therefore suitable in most studies concerning 



species surrogacy (Bevilacqua et al. 2012). If necessary, 
that is, in the analyses of speciose assemblages, decre- 
ments can even be set to 5% of S to avoid excessive gaps 
in the progressive reduction in G. 

For each set of G groups obtained from the stepwise 
reduction procedure, random aggregations are repeated 
1000 times, obtaining 1000 random aggregated matrices 
for each corresponding value of <f>. A triangular matrix 
based on any measure of resemblance (e.g., Jaccard, 
Sorensen, Bray-Curtis, etc.) can be obtained from the ori- 
ginal species-level matrix M s and from each aggregated 
matrix. Then, for each aggregated triangular matrix, the 
correlation value p with the triangular M s is calculated. 
Next, a distance-based permutational multivariate analysis 
of variance (PERMANOVA, Anderson 2001) is performed 
based on each randomly aggregated matrix to test for the 
term of interest in the analysis (e.g., significant effects of 
the investigated factor, such as a natural or human- 
induced environmental gradient, on multivariate assem- 
blage structure). PERMANOVA has been used as default 
multivariate statistical test because it allows analyzing 
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Figure 1. Construction of the null model for the identification of the lowest practicable level of aggregation (i/>i ow ) in BestAgg. The S variables 
(e.g., species, taxa, groups, etc.) in the original S x N matrix (M s ) are aggregated (i.e., grouped and summed) at random into a progressively 
decreasing number of G,- variables following a stepwise reduction in fixed percentage detriments d (details in the text). For each G,*, random 
aggregations of the original S variables are repeated, obtaining 1000 randomly aggregated G,- x 5 matrices (Ms!) with a level of aggregation </>,. 
The level of aggregation for a given matrix is defined as </>, = G/S. For each randomly aggregated matrix and for the original matrix (i.e., M s ), the 
corresponding triangular matrix based on any distance measure of choice is obtained. A distance-based multivariate analysis of variance 
(PERMANOVA) (see text for further details) is performed based on all triangular matrices (the original one and all the derived randomly 
aggregated matrices) to test for the factor of interest in the experiment. The Spearman's correlation p between the triangular M s and each of the 
1000 triangular M Gj is also calculated. Finally, a semilog model of p against ln(0) based on results is fitted in order to identify the lowest <f> value 
at which >95% of PERMANOVA tests on randomly aggregated matrices provides results consistent with those obtained analyzing the original 
data matrix (i.e., i/>| ow ). 
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complex multifactorial designs and testing for interaction 
terms (in contrast to, for instance, analysis of similarities 
(ANOSIM), which allows analyzing only two-factor 
designs and does not test for interactions) (ANOSIM, see 
Clarke 1993), using any distance measure and overcoming 
problems related to non-normality of data (in contrast to, 
for instance, classic MANOVA, which requires normality 
and implicitly uses Euclidean distance) (see Legendre and 
Anderson 1999; Anderson 2001; McArdle and Anderson 
2001). 

The dependence between 4> and the effectiveness of the 
corresponding surrogate groups, which is the logic deriva- 
tion of model assumptions (1) and (2) (see previous 
Methods section), can be now formally checked by fitting 
a semilog model of p values against the corresponding r/> 
values and calculating, for each level of aggregation <j>, the 
percentage of significant PERMANOVA tests for the term 
of interest, out of n = 1000, consistent with results obtained 
at species level. Significant results from analyses on aggre- 
gated data are considered as consistent with those obtained 
at species level if a G < a s , where a s is the significance level of 
the test for the term of interest (i.e., a s = 0.05, if 
0.01 < P < 0.05; S£ S = 0.01, if 0.001 < P < 0.01; oc s = 0.001, 
if P < 0.001) based on species-level data, and a G is the signif- 
icance level of the same test from the analysis on aggregated 
data. 

Finally, the lowest practicable aggregation level 0i ow is 
determined by identifying the lowest <p value allowing the 
95% of PERMANOVAs on randomly aggregated matrices 
to give results consistent with those obtained at species 
level. Given that </> = G/S, it is possible to derive the suffi- 
cient (or, in other words, the minimum) number of 
surrogate groups G m j n = </>k>w x S needed to obtain con- 
sistent results to those obtained analyzing species-level 
data. In such cases we can reject the null hypothesis that 
G min is not sufficient to allow consistent results with spe- 
cies-level analysis with a probability of Type-I error of 
P < 0.05, under the assumption that surrogate groups are 
random subsets of species. It is worth noting here that 
G m in represents a threshold value. The number of surro- 
gate groups that could be employed is not required to be 
necessarily equal to G mm , but just to be within the range 
of sufficient aggregation levels for analyses or, in other 
words, >G min . 

The R code for analyses along with a brief user manual 
is provided (see Appendices SI and S2). Example data are 
also supplied (see Appendices S3 and S4). 

Selecting surrogate groups for BestAgg 

Once G min is set, the subsequent step concerns the selec- 
tion of surrogates. Formally, it could be virtually criterion 
free, as species could be grouped randomly within the 



G m in surrogate groups. However, the approach does not 
intend to legitimate random aggregations of species, but, 
rather, to achieve an aggregation of species coherent with 
the aim of the study and the knowledge of the investi- 
gated system. 

The concept underlying BestAgg is to exploit simulta- 
neously the potential of different surrogate types in pro- 
viding cost-effective assessments of community response. 
The approach, while fixing the sufficient number of sur- 
rogates that could be used, unleash the investigator from 
static surrogacy schemes, allowing the selection of any 
surrogate for species potentially leading to retain ecologi- 
cal information and/or to reduce efforts for the identifica- 
tion of organisms and sample processing. One or more 
species may be selected to form a surrogate following the 
logic of three unifying macrocriteria, to which any other 
single selection criterion can be ascribed: relevance, easi- 
ness, and resemblance. Relevance concerns the importance 
of a given species, taxon, or group of organisms from an 
ecological perspective, whether general (e.g., keystone spe- 
cies, habitat formers, bioengineers, conservation targets), 
context-specific (e.g., tolerant, sensitive, indicator species, 
or taxa), or study-specific (e.g., species, taxa, or group of 
organisms most contributing to the observed patterns). 
Easiness relates to the distinctiveness of a given species, 
taxon, or group of organisms leading to be easily identi- 
fied from a taxonomic, morphological, or functional 
point of view, even by nonexpert taxonomists. Finally, 
resemblance concerns shared characteristics among organ- 
isms, from common ancestry to functional similarity 
(e.g., phylogenetically/taxonomically related species, 
trophic groups) that allow meaningful groupings. 

The process of surrogate selection for BestAgg is guided 
by the interplay among the characteristics of relevance, 
easiness, and resemblance of each species (Fig. 2). High 
priority is given to relevant species that are also easy to 
identify. Such species are directly selected as surrogates 
(Fig. 2). Relevant species whose identification is difficult 
are aggregated, if possible, in easy-to-identify but still rel- 
evant surrogates; otherwise such species are joined with 
not relevant ones (Fig. 2). Intermediate priority is given 
to such surrogates because their easiness is achieved 
through resemblance (i.e., through aggregations of species 
based on their similarities, such as, for instance, morpho- 
logical, functional, etc.). Finally, not relevant species are 
grouped to form surrogates following any appropriate 
aggregation criterion (Fig. 2). Also in this last case easi- 
ness is achieved through resemblance but, as surrogates 
mostly include not relevant species, low priority is given 
to them. 

Aggregations leading to low-priority surrogates should 
increase easiness taking into account, nevertheless, that at 
the end of the selection process the number of selected 
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Figure 2. A conceptual framework for 
surrogate selection in BestAgg. High-priority 
surrogates are single species (or taxa, groups, 
etc.) owning the characteristics of relevance 
and easiness (REL + E). Medium-priority 
surrogates are aggregations of species (or taxa, 
groups) owing the characteristic of relevance, 
but achieving easiness through resemblance 
(REL + E -» RES). Low-priority surrogates are 
aggregations of not relevant species (or taxa, 
groups) achieving easiness through 
resemblance (E -> RES). 

surrogates for BestAgg, namely, G Be stAgg. has to be >G m i n . 
It is worth noting here that G m i n represents the effective 
minimum number of surrogates needed. Therefore, if 
GsestAgg > G min , it should not be reduced necessarily to 
G m ; n , especially if G BestAgg « G min , given that further 
aggregations would have very little effects in reducing 
efforts of sample processing (e.g., sorting, counting, iden- 
tification of organisms). This could not be the case if 
GBestAgg >;> G m i n . In such situations, further aggregations 
of low-priority surrogates first and, subsequently, if neces- 
sary, of medium- and high-priority surrogates may be 
used to reduce the number of selected surrogates. 

Any species, taxon, or group of organisms matching 
one or more of the above macrocriteria is potentially eli- 
gible as surrogate. BestAgg, however, aims to optimizing 
trade-offs between relevance, easiness, and resemblance in 
order to maximizing the relevant ecological information 
while reducing as much as possible the number of surro- 
gates and the difficulty in the identification of organisms. 
In this view, the selection process of surrogates in BestAgg 
is structured following two basic principles: (1) prioritiz- 
ing the choice of surrogates which are at the same time 
ecologically relevant and easy to identify; (2) aggregating 
difficult species into surrogates easier to identify (Fig. 2). 



Checking the surrogate set from BestAgg 

The effectiveness of BestAgg surrogates can be checked by 
comparing results from analyses based on the data matrix 
in which the original species have been aggregated into 
the BestAgg surrogates with those obtained analyzing the 
original species-level data matrix. Multivariate community 
responses using BestAgg surrogates and species should be 
interchangeable. Moreover, the information retained in 
the BestAgg aggregated matrix should be within random 
expectations from the null model. This is because BestAgg 
assumes that surrogates are random subsets of the origi- 
nal pool of species and, therefore, selected surrogates 
should work at least as well as random expectations. 
Indeed, the condition in which the selected surrogates 
allow retaining significantly greater information on spe- 
cies-level patterns than random expectations is even more 
desirable, as this would mean that the selection procedure 
has led to a set of surrogates that are able to represent 
species-level community patterns even better than what is 
expected by chance. To check for this, a test based on 
randomizations can be performed (see Appendix S2 for 
details). In practice, the original S species are randomly 
aggregated in G Bes tAgg groups, where G Be stAgg is the 
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number of BestAgg surrogates obtained from surrogate 
selection (see Fig. 2). Random aggregations are repeated 
1000 times. Correlation values (p ; ) between the original 
species-level matrix and each i randomly aggregated 
matrix are then calculated to obtain a frequency distribu- 
tion against which testing PBestA gg > that is, the correlation 
between the original species-level matrix and the matrix 
aggregated using BestAgg surrogates. Also, PERMANOVA 
is performed on randomly aggregated matrices, and the 
percentage of tests for the term of interest showing con- 
sistent results with those obtained at species level is calcu- 
lated, representing the P-value (i.e., the probability of 
Type-I error) for G Bes tAgg- 

When the above conditions are respected, that is, (1) 
results using BestAgg surrogates are consistent with those 
obtained using species, (2) PBestAgg falls within or above 
random expectations, and (3) the probability of Type-I 
error is <0.05, the information provided by BestAgg sur- 
rogates may substitute effectively species-level information 
in subsequent sampling programs or very similar case 
studies. 

Application of BestAgg to real data 

We applied BestAgg to two real case studies including dif- 
ferent habitats, organism types, and environmental set- 
tings, in order to assess the effectiveness of the approach. 
The first case study, hereafter OP, focused on impact 
assessment of offshore gas platforms on soft bottom 
macrobenthic assemblages. This study included species- 
level data from 2 eight-leg platforms (namely, PI and P2) 
located on mud flats at 90 m depth, in the same 
geographic area (North Ionian Sea). For each platform, 
macrobenthic assemblages were sampled at increasing 
distance from its hard structure (i.e., 300, 1000, and 
3000 m), in five sites for each distance. The second case 
study, hereafter DG, related to assessing patterns of varia- 
tions in sessile assemblages along a depth gradient in 
Mediterranean rocky cliffs (South Adriatic Sea). This case 
study included data on species, but also higher taxa and 
morphological groups, from a 2-year monitoring program 
involving four times of sampling (Tl, T2, T3, and T4). In 
each time, sessile assemblages were sampled at three 
depths (i.e., 5, 15, and 25 m) in four locations, with three 
sampling sites in each location. Additional information 
on the data sets is provided in Table SI. 

The effectiveness of the BestAgg approach was assessed 
following two steps: (1) the BestAgg procedure was 
applied to PI and Tl, which served as pilot assessments 
of BestAgg surrogates for the OP and the DG case study, 
respectively; (2) BestAgg surrogates from pilot assess- 
ments were used to analyze data from the second plat- 
form (i.e., P2) for OP, and from the remaining sampling 



times (i.e., T2, T3, and T4) for DG, which were used as 
test studies in order to check whether such surrogates 
were effective under comparable environmental settings. 

Finally, the performance of BestAgg was also compared 
with results obtained applying a more classical approach 
for species surrogacy based on the concept of taxonomic 
sufficiency. Specifically, we assessed the amount of infor- 
mation on species-level community patterns retained in 
matrices aggregated using taxonomic surrogates versus 
BestAgg surrogates by applying the randomization test 
described above. In addition, we estimated savings 
derived from the application of BestAgg and taxonomic 
surrogates following the approach proposed by Ferraro 
and Cole (1995) (see also Thompson et al. 2003). In this 
approach, savings are estimated by taking the ratio of the 
number of surrogates to the number of species, assuming 
that the time spent to identify organisms is proportional 
to the number of categories (e.g., species, higher taxa, 
groups) to which they must be assigned. 

Statistical analyses 

Two separated null models based on randomly aggregated 
matrices were constructed employing data matrices from 
pilot studies, that is, PI (OP) and Tl (DG), following the 
procedure described previously. Analyses were done using 
the R code provided in Appendix SI (see also Appendix 
S2). The decreasing 4> values were obtained by setting d 
to 5% of S for PI (very speciose assemblages, see above) 
and to 10% of S for Tl. For the construction of null 
models, and for subsequent multivariate analyses, 
PERMANOVA was employed to test for significant effects 
of the investigated environmental drivers (i.e., distance 
from platform for OP and depth for DG) on assemblage 
structure. All analyses were based on Bray-Curtis dissimi- 
larities on untransformed data, using 2000 permutations 
(see Table SI for details on designs for analyses). 

For each pilot assessment, <pi ow and, consequently, 
G m i n , were identified and a linear regression of p values 
from random aggregations against the corresponding In 
(</>) was fitted. For each case study, the specific set of 
BestAgg surrogates based on pilot assessments was 
determined following the procedure described for surro- 
gate selection (see Fig. 2). Study-specific relevant species 
(or taxa, groups) most contributing to the observed 
community patterns were identified through similarity 
percentage analysis (SIMPER, Clarke 1993); only species 
whose contribution to dissimilarities was >3% were 
selected. 

For each pilot assessment, results from PERMANOVA 
on species-level data were compared to those obtained 
from analyses based on data aggregated using BestAgg 
surrogates to check for their effectiveness. The related test 
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based on random aggregations was performed to check 
for the amount of information retained in the BestAgg 
aggregated matrix (see above). 

Species-level data from test studies, namely, P2 (OP) 
and T2, T3, and T4 (DG), were then aggregated using the 
set of BestAgg surrogates from the corresponding pilot 
studies. Finally, PERMANOVA was carried out on data 
aggregated using BestAgg surrogates and results compared 
with those from analyses at species level. 

Data from PI and Tl were also used for a pilot assess- 
ment of the sufficient taxonomic level for analyses, fol- 
lowing the logic of taxonomic sufficiency. Therefore, the 
original species-level matrices were aggregated in taxa of 
single ranks of the Linnaean taxonomic hierarchy. Then, 
PERMANOVA on taxonomically aggregated data matrices 
was carried out to identify the sufficient taxonomic reso- 
lution needed to detect significant effects of the investi- 
gated environmental driver. Null models and the related 
test based on randomizations from BestAgg were used to 
check for the effectiveness of such taxonomic surrogates 
and the information retained in higher taxon matrices. 
Finally, test data for each case study (i.e., P2 for OP, and 
T2, T3, T4, for DG) were analyzed at the respective suffi- 
cient taxonomic level, and results were compared with 
those obtained applying the BestAgg approach. 

Analyses were performed using R (R Development Core 
Team 2010). 

Results 

For both pilot assessments, linear regressions of p against 
ln(0) were significant (P < 0.001), indicating that the 
information retained in the aggregated matrices strongly 
depended on the level of aggregation following a semilog 
model (Fig. 3). 

For the OP case study, the pilot assessment showed 
that the lowest <j> value allowing 95% of PERMANOVAs 
on aggregated data to give consistent results with those 
obtained at species level was 4>i ow = 0.10, corresponding 
to G m i n = 26 (Table 1). For the DG case study, instead, 
0iow = 0.20 and, therefore, G min = 16 (Table 1). This 
means that the original S species, that is, 259 for OP and 
79 for DG could be aggregated in 26 and 16 surrogates, 
respectively, while still allowing analyses to perform as 
well as at species level. 

The procedure for selection of BestAgg surrogates from 
pilot assessments led to define a set of G Bes , Agg = 29 sur- 
rogates for OP (see Table S3) and G Be stAgg = 23 surro- 
gates for DG (see Table S4). General and context-specific 
ecological relevance of surrogates was defined based on 
available scientific information (see Tables S3 and S4), 
whereas study-specific relevance was defined based on 
species most contributing to the observed patterns (see 
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Figure 3. Semilog plot of p values between the species-level matrix 
and each randomly aggregated matrix against the corresponding ij> 
values for pilot studies (A) P1 (OP) and (B) T1 (DG). Fading gray zones 
indicate the range of <f> values at which analyses were consistent with 
those at species level. Dotted lines indicate <^| ow (i.e., the lowest 
practicable aggregation level), sufficient to obtain results consistent 
with those obtained analyzing species-level data. 



Table S2 for results of SIMPER analyses). For OP, all 29 
surrogates were based on taxonomy, with six species 
(Aspidosiphon sp., Corbula gibba, Golfingia sp., Thyasira 
biplicata, Timoclea ovata, and Nuculana commutata), three 
genera (Kelliella, Diplodonta, and Nucula), four families 
(Capitellidae, Cirratulidae, Paraonidae, and Spionidae), 
five orders (Amphipoda, Cumacea, Decapoda, Isopoda, 
and Tanaidacea), 10 classes (Aplacophora, Asteroidea, 
Bivalvia, Echinoidea, Gastropoda, Holothuroidea, Poly- 
chaeta, Ophiuroidea, Scaphopoda, and Turbellaria), and 
one phylum (Sipuncula) (Table S3). For DG, the set of 
23 BestAgg surrogates was more heterogeneous including 
11 taxonomic surrogates from species to class level 
(Agelas oroides, Anthozoa, Axinella sp., Bivalvia, Cirripe- 
dia, Cladocora caespitosa, Cliona spp., Hydrozoa, Peysson- 
nelia spp., Tunicates, and Wrangelia penicillata), and 12 
morphological/functional groups (calcareous tube worms, 
canopy-forming algae, coarsely branched/unbranched 
algae, Crambe/Spirastrella, encrusting Bryozoans, erect 
Bryozoans, encrusting calcified Rhodophytes, encrusting/ 
massive sponges, green filamentous algae, Madreporari- 
ans/Zoanthidea, massive black sponges, and turf-forming 
Algae) (Table S4). 
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Table 1. Percentage of tests from PERMANOVA on random aggregated data consistent with those from species-level analyses, at decreasing 
levels of aggregation (<f>). 



OP - Pilot assessment (Platform P1 S = 259) DG - Pilot assessment (Time 1 S = 79) 



Number of 


Aggregation 


% Analyses consistent 


Number of 


Aggregation 


% Analyses consistent 


surrogates (G) 


ratio ((/>) 


with species level 


surrogates (G) 


ratio {</>) 


with species level 


247 


0.95 


1 00% 


72 


0.90 


100% 


234 


0.90 


1 00% 


64 


0.80 


1 00% 


221 


0.85 


1 00% 


56 


0.70 


1 00% 


208 


0.80 


100% 


48 


0.60 


1 00% 


195 


0.75 


100% 


40 


0.50 


99% 


182 


0.70 


1 00% 


32 


0.40 


99% 


169 


0.65 


100% 


24 


0.30 


99% 


156 


0.60 


100% 


16 


0.20 


98% 


143 


0.55 


100% 
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0.10 


84% 


130 


0.50 


100% 
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100% 








78 


0.30 
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1 00% 








52 


0.20 


100% 








39 


0.15 


100% 








26 


0.10 


98% 








13 


0.05 


69% 









The corresponding number of surrogates (G) is also provided. The lowest practicable aggregation (4>)iow ar| d the corresponding minimum number 
of surrogates G min are given in bold. 



For both pilot assessments, tests based on randomizations 
showed that the probability of G BestAgg to fail in depicting 
species-level community patterns was P = 0.021 for OP and 
P = 0.005 for DG. Correlation PBestAgg was m both cases 
significantly {P < 0.05) higher than random expectations 
(Fig. 4), indicating that data aggregated using BestAgg sur- 
rogates retained much more of the original species-level 
information than what is expected to occur by chance. 

Pilot assessments showed that PERMANOVA based on 
data aggregated using BestAgg surrogates allowed obtaining 
the same results of species-level analyses (Table 2). For OP, 
the variance component associated to investigate source of 
variation (i.e., distance from platform) accounted for 22% 
of the total variance at species level and 24% when using 
the BestAgg surrogates. For DG, the variance component 
associated to investigate source of variation (i.e., depth) 
accounted for the 26% of the total variance at species level 
and 27% when using BestAgg surrogates. Analyses on 
aggregate data were able to detect not only the main effect 
of the investigated sources of variation but also to depict 
consistently species-level patterns of difference in commu- 
nity structure along the studied gradients (Table 2). For 
both case studies, PERMANOVA on test data using the 
specific BestAgg surrogates derived from pilot assessments 
showed results consistent with those that would have been 
obtained if species were analyzed (Table 2). 



For both case studies, the information retained in taxo- 
nomically aggregated matrices fell within or below the 
95% confidence interval from random expectation for all 
investigated taxonomic levels (i.e., genus, family, order, 
class, phylum) (Fig. 5), indicating that taxonomic surro- 
gates behaved as, or even worse, than random groups of 
species. Pilot assessments of taxonomic surrogates based 
on taxonomic sufficiency indicated order and phylum, 
respectively, as the sufficient taxonomic levels for analyses 
in OP and DG (Table 3). In contrast, the sufficient taxo- 
nomic level predicted based on the lowest practicable 
aggregation (0i ow ) was that of order for both case studies 
(Table 3). For OP, PERMANOVA on test data aggregated 
at order level confirmed this taxonomic resolution as suf- 
ficient in providing results consistent with those obtained 
using species (Table 4). For DG, analyses showed that 
orders were effective surrogates, whereas the analysis at 
phylum level, although still detecting main effects, was 
unsuitable to depict community pattern of difference 
along the investigated environmental gradient as well as 
at species level (Table 4). 

The application of BestAgg surrogates led to an esti- 
mated saving of time during sample processing and 
organism identification of 90% for OP and 71% for DG, 
in contrast to savings of 85% and 45% for OP and DG, 
respectively, when using taxonomic surrogates. 
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pragmatic solution to overcome impediments related to 
fine taxonomic identifications of organisms in ecological 
studies (Bevilacqua et al. 2012). However, a number of 
issues on taxonomic surrogates remained largely unsolved, 
possibly preventing the consolidation of such a practice 
in routine monitoring programs despite its undeniable 
advantages (Dauvin et al. 2007). The use of taxonomic 
surrogates is problematic when the allocation of species 
into higher taxa is queried, or when cladistic revisions of 
the taxonomic hierarchy lead to the insertion/removal of 
additional (e.g., infraorder, superfamily) or classic ranks. 
Also, taxa of the same rank may not be equivalent from a 
phylogenetic point of view among different phyla, making 
the use of taxonomic surrogates less stringent when con- 
sidering assemblages embracing more than one phylum 
(Bertrand et al. 2006; Bevilacqua et al. 2009). More 
importantly, ecological similarity among species within 
taxa may be markedly taxon specific (e.g., Losos 2008), 
hampering the association of a clear ecological meaning 
to changes in community structure when it is codified 
through ranks of the Linnaean hierarchy higher than 
species (Somerfield and Clarke 1995; Terlizzi et al. 2003; 
Bertrand et al. 2006; Jones 2008). 

In spite of these evident intrinsic limits, approaches 
based on taxonomic relationships have profoundly condi- 
tioned the way species surrogacy is conceived so far. The 
BestAgg framework proposed in this study attempts to 
rise above this stagnant perspective. BestAgg focuses on 
the aggregation of variables in multivariate data matrices, 
looking at the effect of aggregation on congruencies 
between the information contained in the original versus 
the corresponding aggregated matrix (Bevilacqua et al. 
2012). The approach is based on the simple concept that 
the higher the level of aggregation (i.e., the ratio of the 
number of aggregated variables to the number of original 
variables) the higher the loss of information. As it is the 



Table 2. Results of PERMANOVA on data aggregated using BestAgg surrogates. Results consistent (including pairwise comparisons) with those 
obtained at species level (which are also reported) are given in bold. 



BestAgg pilot assessment 










Case study 
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Source of variation 
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P1 
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(W ± M ± F)*** 


(W ?£ M st F)*** 


DG 


T1 


Depth 


(5^15 = 25)*** 


(5 + 15 = 25)*** 


Application of BestAgg 










Case study 


Test study 


Source of variation 


Species level 


BestAgg 


OP 


P2 


Distance 


(W = M ± F)** 


(N = M + F)** 


DG 


T2 


Depth 


(5 ^ 15 = 25)* 


(5 # 15 = 25)* 




T3 


Depth 


(5^15/ 25)** 


(5 * 15 # 25)** 




T4 


Depth 


(5 4- 15 25)** 


(5 # 15 # 25)** 



P1 , Platform 1 ; P2, Platform 2; N = 300 m; M = 1 000 m; F = 3000; T1 -2-3-4, Times 1 -2-3-4. 
*P < 0.05; **P < 0.01; ***P < 0.001 . 
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Discussion 



Over the past three decades, the use of higher taxa as sur- 
rogates for species has received increasing attention as a 
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Figure 4. Frequency distribution (n = 1000) of p values between the 
species-level matrix and matrices in which species were randomly 
aggregated in G Bes tAgg groups (see text and Fig. 2), for pilot studies 
(A) P1 (OP) and (B) T1 (DG). Dotted lines indicate p B estAgg, that is, the 
correlation value between the species-level matrix and the matrix 
aggregated using BestAgg surrogates, which in both cases fall 
significantly (P < 0.05) above random expectations. 
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morphological groups, etc.) and the logic guiding vari- 
ables' aggregation are irrelevant. Thus, BestAgg is applica- 
ble to any kind of community data from any 
environmental context and type of organisms (whether 
involving a single phylum or more different ones), allows 
mixing any type of surrogates (as the identification of the 
sufficient number of surrogates goes beyond any potential 
relationship among species, whether taxonomic, phyloge- 
netic, etc.), and prioritizes the choice of ecologically 
meaningful groupings (in contrast to taxonomic surro- 
gates, for instance, which are based on taxonomic related- 
ness regardless of whether higher taxa could actually 
represent ecologically meaningful units). 

It could be argued that such numerical relationships 
might be biased by sample size, as might happen for 
ratios between taxonomic categories/subcategories (Gotelli 
and Colwell 2001). This is not the case for aggregation 
ratios in BestAgg because (1) the approach does not 
assume any intrinsic relationship between the original and 
the aggregated variables (which, actually, are arbitrary cat- 
egories deriving from random aggregations) and (2) sam- 
ple size is constant for a given study. 

Disentangling species surrogacy from static aggregation 
schemes, BestAgg, can also take advantage of using differ- 
ent surrogate types (e.g., higher taxa, functional groups, 



Table 3. <f> values of each taxonomic level and sufficient <j> values based on BestAgg (pilot studies). 









Values of <j> based on taxonomic aggregation 






BestAgg prediction 


Sufficient 


Case 


Pilot 


Source of 






Sufficient <f> 


of sufficient 


taxonomic 


study 


study 


variation 


S Species Genus Family Order Class 


Phylum 


from BestAgg 


taxonomic level 


level (classic) 


OP 


P1 


Distance 


259 1*** 0.76*** 0.48*** 0.15*** 0.06 ns 


0.02 ns 


0.10 


Order 


Order 


DG 


T1 


Depth 


79 1** 0.96** 0.77** 0.55** 0.25* 


0.13** 


0.20 


Order 


Phylum 



The sufficient taxonomic resolution for analyses based on taxonomic sufficiency (classic approach) and the sufficient taxonomic resolution 
predicted on the basis of the sufficient <p (i.e., (/>| ow ) from BestAgg are provided. The number of species S in the original matrix and results of 
PERMANOVA for each taxonomic level are also reported. P1, Platform 1; T1, Time 1. ns, not significant. *P < 0.05; **P < 0.01; ***P < 0.001. 



Table 4. Results of PERMANOVA on test data aggregated on the basis of the sufficient taxonomic level determined using taxonomic sufficiency 
(classic approach) and the lowest practicable aggregation level <jl| ow from BestAgg (see Table S5). 

Case study Source of variation Taxonomic level Test study 



OP Distance P2 

Species (W = M / F)** 

Order (W = M j£ f)** 

DG Depth 72 73 74 

Species (5/15 = 25)* (5/15/ 25)** (5 / 15 / 25)** 

Order (5 + 15 = 25)* (5^15^25)** (5^15^25)** 

Phylum (5 = 15 / 25)* (5 / 15 = 25)** (5 / 15 = 25)* 

For OP, both approaches indicated the level of order as sufficient. For DG, the classic approach and <t> lovj indicated phylum and order, respectively, 
as the sufficient taxonomic levels. Results consistent (including pairwise comparisons) with those obtained at species level (which are also reported) 
are given in bold. P2, Platform 2; W = 300 m; M = 1000 m; F= 3000; T2-3-4, Times 2-3-4. *P < 0.05; **P < 0.01; ***P < 0.001. 



numerical relationship between original and aggregated 
variables that matters, the nature of variables 
(which could express the abundance of species, taxa, 
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Figure 5. Mean ± 95% confidence interval (n = 1000) of p values 
between species and randomly aggregated matrices for (A) P1 pilot 
study (OP case study) and (B) T1 pilot study (DG case study). Black 
points are p values between species and higher taxon matrices at 
genus, family, order, class, and phylum level. Numbers in brackets 
indicate the number of taxa in each taxonomic level. 
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ecological indicators, etc.). Such a prerogative is decisive 
to open species surrogacy to ecological knowledge (Groc 
et al. 2010), which can guide the choice of those surro- 
gates more aligned to ecological characteristics of species 
in order to maximizing ecological information on com- 
munity patterns notwithstanding the inherent reduction 
in taxonomic detail. Although this aspect seems to intro- 
duce some level of subjectivity in the approach, surrogates 
selection in BestAgg is far from being arbitrary. The num- 
ber of effective surrogates is determined objectively and 
the identity of surrogates is determined based on objective 
macrocriteria. Also, evidence from pilot assessments and 
the solid scientific information on the investigated system 
substantiate the choice of surrogates, limiting the subjec- 
tivity of the experimenter (Dauvin et al. 2007). Moreover, 
in contrast to other approaches, within the BestAgg 
framework, the experimenter can set a priori the proba- 
bility of failing in detecting significant results using the 
selected surrogates and therefore controlling for uncer- 
tainty on their application. 

Quite intuitively, the set of surrogates from BestAgg 
may be strictly context specific because their choice, as for 
any other approach to species surrogacy, depends on the 
aim of the study, the particular environmental situation, 
the organisms involved, and the available ecological 
knowledge of the system. Pragmatic considerations seem 
to suggest that levels of aggregation up to 0.4-0.5 (corre- 
sponding to a number of surrogates equal to 40-50% of 
the original number of species) are usually still conducive 
to effective representations of species-level community 
patterns (Bevilacqua et al. 2012). However, one-fit-all 
solutions in species surrogacy could be misleading and the 
identification of suitable surrogates for a given study needs 
to be based on representative pilot assessments at species 
level (Terlizzi et al. 2003; Jones 2008; Siqueira et al. 2012). 
Therefore, the set of effective surrogates obtained from the 
application of BestAgg to a pilot study should be applied 
to subsequent studies in very similar environmental con- 
texts only (e.g., same source of impact in the same habitat, 
the same natural gradient in areas of the same region, 
etc.), and given the same experimental design, which, 
clearly, needs to be appropriately planned to assess the 
effects of the investigated source of variability in modify- 
ing community patterns. We simulated the application of 
BestAgg to real case studies, in which a first pilot assess- 
ment was performed to define the set of effective surro- 
gates that was then used successfully in similar 
environmental investigations (as in the OP example), or 
for subsequent monitoring programs (as in the DG exam- 
ple). Results demonstrated the robustness of BestAgg in 
analyzing community patterns in relation to both natural 
and human- driven gradients, whether involving individual 
or colonial species, although further efforts are required to 



extend this approach to other environmental contexts. As 
the estimation of cost savings deriving from using surro- 
gates strongly depends on the investigated group(s) of 
organisms, the number of specimens to be classified, and 
available taxonomic expertise (Ferraro and Cole 1990), 
quantifying the advantages provided by BestAgg surrogates 
in term of costs with respect to classic taxonomic surro- 
gates is a difficult task and estimated cost savings could 
not have a general validity. However, our results on real 
case studies showed that BestAgg surrogates might lead 
gaining up to 25% of time during sample processing with 
respect to classic taxonomic surrogates. Moreover, such a 
time saving is likely to be underestimated because, in con- 
trast to taxonomic surrogates, which imply at least a basic 
taxonomic expertise, the choice of surrogates in BestAgg 
prioritizes identification easiness and might involve non- 
taxonomic surrogates (e.g., morphological groups). 

Above all, our findings showed that BestAgg represents 
a valuable alternative method to species surrogacy in 
environmental impact assessment and ecological monitor- 
ing, potentially leading to increased time saving with 
respect to traditional approaches, such as those involving 
the use of higher taxa as surrogates for species. In addi- 
tion, BestAgg recognizes the need for conferring an eco- 
logical meaning to surrogates, which is fundamental for 
the interpretation of ecological patterns. It is increasingly 
evident that the quest for effective proxies for species has 
to abandon static approaches, moving toward the integra- 
tion of taxonomic, phylogenetic, and functional aspects 
(Devictor et al. 2010). BestAgg may represent a step 
forward in this direction. 
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